Search CORE

Fake View Analytics in Online Video Services

Author: Bolton R. J.
Cao Q.
Joachims T.
Vapnik V. N.
Publication venue
Publication date: 18/12/2013
Field of study

Online video-on-demand(VoD) services invariably maintain a view count for each video they serve, and it has become an important currency for various stakeholders, from viewers, to content owners, advertizers, and the online service providers themselves. There is often significant financial incentive to use a robot (or a botnet) to artificially create fake views. How can we detect the fake views? Can we detect them (and stop them) using online algorithms as they occur? What is the extent of fake views with current VoD service providers? These are the questions we study in the paper. We develop some algorithms and show that they are quite effective for this problem.Comment: 25 pages, 15 figure

Kernel method for nonlinear Granger causality

Author: A. Papoulis
A. Pikovsky
Daniele Marinazzo
H. Kantz
J. Shawe-Taylor
Mario Pellicoro
Sebastiano Stramaglia
V. Vapnik
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2008
Field of study

Important information on the structure of complex systems, consisting of more than one component, can be obtained by measuring to which extent the individual components exchange information among each other. Such knowledge is needed to reach a deeper comprehension of phenomena ranging from turbulent fluids to neural networks, as well as complex physiological signals. The linear Granger approach, to detect cause-effect relationships between time series, has emerged in recent years as a leading statistical technique to accomplish this task. Here we generalize Granger causality to the nonlinear case using the theory of reproducing kernel Hilbert spaces. Our method performs linear Granger causality in the feature space of suitable kernel functions, assuming arbitrary degree of nonlinearity. We develop a new strategy to cope with the problem of overfitting, based on the geometry of reproducing kernel Hilbert spaces. Applications to coupled chaotic maps and physiological data sets are presented.Comment: Revised version, accepted for publication on Physical Review Letter

Ghent University Academic Bibliography

Archivio istituzionale della ricerca - Università di Bari

Optimizing Deep Packet Inspection for High-Speed Traffic Analysis

Author: A. Este
D. Bonfiglio
F. Gringoli
Fulvio Risso
L. Bernaille
Luigi Ciminiera
M. Crotti
Niccolò Cascarano
R. Smith
V. Paxson
V. Vapnik
Publication venue: Springer
Publication date: 01/01/2011
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Improving classification for brain computer interfaces using transitions and a moving window

Author: A. Kubler
F. Galán
G. Dornhege
V. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Proceeding of: Biosignals 2009. International Conference on Bio-inspired Systems and Signal Processing, BIOSTEC 2009. Porto (Portugal), 14-17 January 2009The context of this paper is the brain-computer interface (BCI), and in particular the classiﬁcation of signals with machine learning methods. In this paper we intend to improve classiﬁcation accuracy by taking advantage of a feature of BCIs: instances run in sequences belonging to the same class. In that case, the classiffication problem can be reformulated into two subproblems: detecting class transitions and determining the class for sequences of instances between transitions. We detect a transition when the Euclidean distance between the power spectra at two different times is larger than a threshold. To tackle the second problem, instances are classiﬁed by taking into account, not just the prediction for that instance, but a moving window of predictions for previous instances. Experimental results show that our transition detection method improves results for datasets of two out of three subjects of the BCI III competition. If the moving window is used, classiﬁcation accuracy is further improved, depending on the window size.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Geometrical complexity of data approximators

Author: A. Hirotugu
A.N. Gorban
A.N. Gorban
A.N. Kolmogorov
H. Akaike
I.J. Myung
M.R. Forster
R.J. Brooks
V. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

There are many methods developed to approximate a cloud of vectors embedded in high-dimensional space by simpler objects: starting from principal points and linear manifolds to self-organizing maps, neural gas, elastic maps, various types of principal curves and principal trees, and so on. For each type of approximators the measure of the approximator complexity was developed too. These measures are necessary to find the balance between accuracy and complexity and to define the optimal approximations of a given type. We propose a measure of complexity (geometrical complexity) which is applicable to approximators of several types and which allows comparing data approximations of different types.Comment: 10 pages, 3 figures, minor correction and extensio

Leicester Research Archive

Combinatorial probability and the tightness of generalization bounds

Author: A. A. Ivakhnenko
D. A. Kochedykov
G. S. Lbov
J. K. Martin
J. Quinlan
K. V. Vorontsov
K. V. Vorontsov
L. N. Bol’shev
M. Marchand
R. L. Rivest
V. N. Vapnik
V. N. Vapnik
V. N. Vapnik
V. Vapnik
W. W. Cohen
Yu. K. Belyaev
Publication venue: 'Pleiades Publishing Ltd'
Publication date
Field of study

Subsampling in Smoothed Range Spaces

Author: B Aronov
B Chazelle
D Haussler
J Beck
J Edmonds
J Matoušek
J Pach
N Alon
V Vapnik
Y Li
Publication venue
Publication date: 30/10/2015
Field of study

We consider smoothed versions of geometric range spaces, so an element of the ground set (e.g. a point) can be contained in a range with a non-binary value in

[0,1]

. Similar notions have been considered for kernels; we extend them to more general types of ranges. We then consider approximations of these range spaces through

\varepsilon

-nets and

\varepsilon

-samples (aka

\varepsilon

-approximations). We characterize when size bounds for

\varepsilon

-samples on kernels can be extended to these more general smoothed range spaces. We also describe new generalizations for

\varepsilon

-nets to these range spaces and show when results from binary range spaces can carry over to these smoothed ones.Comment: This is the full version of the paper which appeared in ALT 2015. 16 pages, 3 figures. In Algorithmic Learning Theory, pp. 224-238. Springer International Publishing, 201

Optimal estimation for Large-Eddy Simulation of turbulence and application to the analysis of subgrid models

Author: A. Moreau
Bishop C. M.
Deutsch R.
Friedt J.-M.
Haykin S.
J. P. Bertoglio
O. Teytaud
Vapnik V. N.
Publication venue: 'AIP Publishing'
Publication date: 06/06/2006
Field of study

The tools of optimal estimation are applied to the study of subgrid models for Large-Eddy Simulation of turbulence. The concept of optimal estimator is introduced and its properties are analyzed in the context of applications to a priori tests of subgrid models. Attention is focused on the Cook and Riley model in the case of a scalar field in isotropic turbulence. Using DNS data, the relevance of the beta assumption is estimated by computing (i) generalized optimal estimators and (ii) the error brought by this assumption alone. Optimal estimators are computed for the subgrid variance using various sets of variables and various techniques (histograms and neural networks). It is shown that optimal estimators allow a thorough exploration of models. Neural networks are proved to be relevant and very efficient in this framework, and further usages are suggested

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL Clermont Université

A preliminary approach to the multilabel classification problem of Portuguese juridical documents

Author: A. McCallum
B. Schölkopf
C. Cortes
G. Salton
I. Witten
N. Cancedda
N. Cristianini
P. Quaresma
R. Quinlan
T. Joachims
V. Vapnik
V. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Portuguese juridical documents from Supreme Courts and the Attorney General’s Office are manually classified by juridical experts into a set of classes belonging to a taxonomy of concepts. In this paper, a preliminary approach to develop techniques to automat- ically classify these juridical documents, is proposed. As basic strategy, the integration of natural language processing techniques with machine learning ones is used. Support Vector Machines (SVM) are used as learn- ing algorithm and the obtained results are presented and compared with other approaches, such as C4.5 and Naive Bayes